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<210> 1 
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<212> DNA 
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<400> 1 

gccaggcgga 

tcatcaatgc 

ctcgtggctc 

cttgcgcgct 

gcctgggcga 

tgaccattgc 

acatcatggg 

ggattgacca 

gcccccgctc 

ccgctattga 

ccaacgactg 

gccagttcac 

tgtgggagga 

tctcggacgg 

ccccgctgac 

ccgacaagct 

gcggtgtgga 

tggacattga 

acagcgtcta 

ctgtggaccc 

tggacatcat 

tcctgggtac 

agggccgcgc 

gcgccgactt 

ccatgcacta 

aggagggcgt 

acgccgacgc 

accccgagat 

agtgggaggg 



gcgtatggct 
cgcgtcgttc 
cgcacgcaag 
ggacatcgtg 
tgtgactggt 
ccctcgctac 
cgagaaggtc 
cccctggttc 
cggcgctgac 
ggctgcccgc 
gcactccgcc 
caaggccaag 
ggctttcaag 
ctatgccaag 
gggaaagacc 
ggtgactgtg 
gctggacacc 
ggagtggaac 
cgccggcaag 
caccgccccc 
cctggccgcc 
cggcaaggcc 
caagggcgtg 
catgctggtg 
cggtaccgtg 
caccggcttc 
cctggccgcc 
ggtggccaac 
cctgctggag 



gttgcctcta 
ggtgtcaaga 
tccacctcgc 
atggttgctg 
ggcctgccta 
gaccagtacg 
cgctacttcc 
ctggccaagg 
tacctggaca 
gtgctgccct 
ctggtgcccg 
tcggtgctgg 
gacacgaagc 
gtttacactg 
tacaagaaga 
tcgcccaact 
gtcatccgcg 
cccaagaccg 
gccgccgcca 
ctgttcgcct 
ctgcccaaga 
gcctacgaga 
gtcaagttct 
ccctcgcgct 
cccgtggtag 
cacatgggcg 
accgtgcgcc 
tgcatcagcc 
gaggtggtgt 



ccagccgccc 
agaccgcgaa 
gctcggctgt 
ctgaggtcgc 
ttgagctggt 
ctgacgcctg 
actccatcaa 
tctggggcaa 
accacaagcg 
tcggccccgg 
tcctgctgaa 
ctatccacaa 
tgccccaggc 
aggccacccc 
tcaactggct 
acgcgaccga 
ccaagggcat 
acaagttcct 
aggaggccct 
tcatcggccg 
tcctggccac 
agctggtgaa 
cggcgcccct 
tcgagccctg 
cctccaccgg 
ccctgaaccc 
gtgccagcga 
aggacctgtc 
acggcaaggg 



cagcagcgcg 
ccagctgctg 
tactggtgcc 
cccttggtcc 
caagcgcggc 
ggacacctcg 
gaagggcgtg 
gaccggctcc 
cttcgccctg 
cgaggactgc 
ggacgagtac 
catcgccttc 
cgcctttgac 
catggaggag 
gaagggtggc 
gatcgctgcc 
tgagggcatt 
gtctgcgccc 
gcaggccgag 
cctggaggag 
ccccaaggtg 
cgccatcggc 
ggcgcacatg 
cggcctgatc 
cggcctggtc 
cgacaagctg 
ggtgtttgcg 
ctggtccaag 
cggcgtggcc 



cgtcctatcg 
cgtgagcttg 
actggtgcca 
aagacgggcg 
caccgcgtca 
gtggtcgtgg 
caccgcgtgt 
aagctgtacg 
ttctgcaagg 
gtcttcgtgg 
cagcccaagg 
cagggccgca 
aagctggcct 
gacgagaagc 
attatcgccg 
gatgccgccg 
gtgaacggca 
tacgaccaga 
ctgggcctgc 
cagaagggtg 
cagatcgcca 
accaagtaca 
ctcaccgccg 
cagctgcacg 
gacaccgtca 
gacgaggctg 
ggcggccgct 
cccgcccaga 
accgccaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 



- It 



aggaggagat caaggtgccc gttgccgaga 
acgcccccaa caccctgaag cccgtgtccg 
ccaaggtcgg caccaccgcc cccgccatgg 
cctcgcccgc cgccgccacc cccaaggtga 
ccaagcccaa gaccgctggc ctcaagctgg 
agaacggcgc tgcctccaac ggcaacggca 
ccaagcccct ggtctccgcc gccacccgca 
gcggcgacag catgagcggc tcgaccaaag 
gcggccgcca ccggcgagga gcaggcttgc 
gagcatggca ggcggaaacg tgtgtactga 
tgaagtcggt tttgcgagac cggagaaacg 
gtttcggttt tgcccaagtc caaaagaaga 
tgcgcggcgc ggcgaccacg ccggcccgtg 
tcggcggcaa gagggcgggt ctgtacactc 
gagagtttgc cattttgtgg gacgtgagcg 
cctccggcaa cggagaagtg ggaggcgctg 
ggatacataa gagcgtgtgg aatggtggta 
ggttttgttg aggtgcagac ggcaccgtcg 
cttgggctca tttttggtgc ccgtcgatga 
ggctcgaagc gagggtttgt ggaagtggag 
cttgactgga ggtacacaaa gtggagcgtg 
acccagtagt gtggaaagta gttggacctg 
tgagagagta acaaagatgg cacctgaaaa 



agatccccgg cgacctgccc gccgtgtcct 1800 
cctccgtgga gggcaacggc gccgccgcgc 1860 
gcgcgtggcg cgcgaccacc ccctcgggcc 1920 
ccacctacaa gcccgccctg cccgccaccg 1980 
ccggtgaggc ctccaccacc tcgacctcgg 2040 
acggtgcctc ggcctccaag acctcggctg 2100 
agtccgccta aagcggcagt agccgcagag 2160 
ctgtggcagg aacggctgta gcagcggcag 2220 
ggcagcgagg gcgatgagct tagcggccgt 2280 
aatgtggtgc atgagagtgt cgtgctgtaa 2340 
ccggtttggt tttgtagtgc agggcctgtg 2400 
gtaacgaaac tgtagcagta gcagagcact 2460 
cgcagcctgt cctgccctca gccttgtgat 2520 
catccattcc aggatttttg caggctgcct 2580 
gcgggacggc cgcgccgggc tctcctaccg 2640 
tagcccggtg accccccaat gtagaggatg 2700 
aaagaggagg ggcctgggtc gcccctcgat 2760 
gcgtcaaagg ccctcgcaag gcccgggtgc 2820 
tgagagattg gccagcggtt ttttgaggct 2880 
cgaggagggt tggagaaaga ggcggacatg 2940 
cgacggcacg gaggcattgg cggactattg 3000 
aattctttga gagtaccgcg cattaatccg 3060 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 3117 



<210> 2 
<211> 2124 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fragment of the 
complete sequence of cDNA coding for the GBSSI of 
Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1)..<2124) 

<400> 2 

atg get gtt gec tct acc age cgc ccc age age gcg cgt cct ate gtc 48 
Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 
15 10 15 

ate aat gee gcg teg ttc ggt gtc aag aag acc gcg aac cag ctg ctg 96 
lie Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 



cgt gag ctt get cgt ggc tec gca cgc aag tec acc teg cgc teg get 144 

Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 40 45 

gtt act ggt gee act ggt gee act tgc gcg ctg gac ate gtg atg gtt 192 

Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp He Val Met Val 
50 55 60 

get get gag gtc gee cct tgg tec aag acg ggc ggc ctg ggc gat gtg 240 

Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 



1 



65 70 75 80 

act ggt ggc ctg cct att gag ctg gtc aag cgc ggc cac cgc gtc atg 288 
Thr Gly Gly Leu Pro He Glu Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 

acc att gcc cct cgc tac gac cag tac get gac gec tgg gac acc teg 336 
Thr He Ala Pro Arg Tyr Asp Gin Tyr Ala Asp Ala Trp Asp Thr Ser 
100 105 110 

gtg gtc gtg gac ate atg ggc gag aag gtc cgc tac ttc cac tec ate 384 
Val Val Val Asp He Met Gly Glu Lys Val Arg Tyr Phe His Ser He 
115 120 125 

aag aag ggc gtg cac cgc gtg tgg att gac cac ccc tgg ttc ctg gcc 432 
Lys Lys Gly Val His Arg Val Trp He Asp His Pro Trp Phe Leu Ala 
130 135 140 

aag gtc tgg ggc aag acc ggc tec aag ctg tac ggc ccc cgc tec ggc 480 
Lys Val Trp Gly Lys Thr Gly Ser Lys Leu Tyr Gly Pro Arg Ser Gly 
145 150 155 160 

get gac tac ctg gac aac cac aag cgc ttc gcc ctg ttc tgc aag gcc 528 
Ala Asp Tyr Leu Asp Asn His Lys Arg Phe Ala Leu Phe Cys Lys Ala 
165 170 175 

get att gag get gcc cgc gtg ctg ccc ttc ggc ccc ggc gag gac tgc 576 
Ala He Glu Ala Ala Arg Val Leu Pro Phe Gly Pro Gly Glu Asp Cys 
180 185 190 

gtc ttc gtg gcc aac gac tgg cac tec gcc ctg gtg ccc gtc ctg ctg 624 
Val Phe Val Ala Asn Asp Trp His Ser Ala Leu Val Pro Val Leu Leu 
195 200 205 

aag gac gag tac cag ccc aag ggc cag ttc acc aag gcc aag teg gtg 672 
Lys Asp Glu Tyr Gin Pro Lys Gly Gin Phe Thr Lys Ala Lys Ser Val 
210 215 220 

ctg get ate cac aac ate gcc ttc cag ggc cgc atg tgg gag gag get 720 
Leu Ala He His Asn He Ala Phe Gin Gly Arg Met Trp Glu Glu Ala 
225 230 235 240 

ttc aag gac acg aag ctg ccc cag gcc gcc ttt gac aag ctg gcc ttc 768 
Phe Lys Asp Thr Lys Leu Pro Gin Ala Ala Phe Asp Lys Leu Ala Phe 
245 250 255 

teg gac ggc tat gcc aag gtt tac act gag gcc acc ccc atg gag gag 816 
Ser Asp Gly Tyr Ala Lys Val Tyr Thr Glu Ala Thr Pro Met Glu Glu 
260 265 270 

gac gag aag ccc ccg ctg acg gga aag acc tac aag aag ate aac tgg 864 
Asp Glu Lys Pro Pro Leu Thr Gly Lys Thr Tyr Lys Lys He Asn Trp 
275 280 285 

ctg aag ggt ggc att ate gcc gcc gac aag ctg gtg act gtg teg ccc 912 
Leu Lys Gly Gly He He Ala Ala Asp Lys Leu Val Thr Val Ser Pro 
290 295 300 



aac tac gcg acc gag ate get gec gat gec gec ggc ggt gtg gag ctg 960 
Asn Tyr Ala Thr Glu lie Ala Ala Asp Ala Ala Gly Gly Val Glu Leu 
305 310 315 320 

gac acc gtc ate cgc gee aag ggc att gag ggc att gtg aac ggc atg 1008 
Asp Thr Val lie Arg Ala Lys Gly lie Glu Gly lie Val Asn Gly Met 
325 330 335 

gac att gag gag tgg aac ccc aag acc gac aag ttc ctg tct gcg ccc 1056 
Asp lie Glu Glu Trp Asn Pro Lys Thr Asp Lys Phe Leu Ser Ala Pro 
340 345 350 

tac gac cag aac age gtc tac gec ggc aag gec gec gee aag gag gee 1104 
Tyr Asp Gin Asn Ser Val Tyr Ala Gly Lys Ala Ala Ala Lys Glu Ala 
355 360 365 

ctg cag gee gag ctg ggc ctg cct gtg gac ccc acc gec ccc ctg ttc 1152 
Leu Gin Ala Glu Leu Gly Leu Pro Val Asp Pro Thr Ala Pro Leu Phe 
370 375 380 

gee ttc ate ggc cgc ctg gag gag cag aag ggt gtg gac ate ate ctg 1200 
Ala Phe lie Gly Arg Leu Glu Glu Gin Lys Gly Val Asp lie lie Leu 
385 390 395 400 

gec gee ctg ccc aag ate ctg gee acc ccc aag gtg cag ate gee ate 1248 
Ala Ala Leu Pro Lys lie Leu Ala Thr Pro Lys Val Gin lie Ala lie 
405 410 415 

ctg ggt acc ggc aag gee gee tac gag aag ctg gtg aac gec ate ggc 1296 
Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala lie Gly 
420 425 430 

acc aag tac aag ggc cgc gee aag ggc gtg gtc aag ttc teg gcg ccc 1344 
Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

ctg gcg cac atg etc acc gee ggc gee gac ttc atg ctg gtg ccc teg 1392 
Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

cgc ttc gag ccc tgc ggc ctg ate cag ctg cac gee atg cac tac ggt 1440 
Arg Phe Glu Pro Cys Gly Leu lie Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 

acc gtg ccc gtg gta gee tec acc ggc ggc ctg gtc gac acc gtc aag 1488 
Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

gag ggc gtc acc ggc ttc cac atg ggc gec ctg aac ccc gac aag ctg 1536 
Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

gac gag get gac gee gac gee ctg gee gec acc gtg cgc cgt gee age 1584 
Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

gag gtg ttt gcg ggc ggc cgc tac ccc gag atg gtg gee aac tgc ate 1632 
Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys lie 



1 



530 



535 



540 



age cag gac ctg tec tgg tec aag ccc gee cag aag tgg gag ggc ctg 1680 
Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
545 550 555 560 

ctg gag gag gtg gtg tac ggc aag ggc ggc gtg gec acc gec aag aag 1728 
Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 

gag gag ate aag gtg ccc gtt gec gag aag ate ccc ggc gac ctg ccc 1776 
Glu Glu lie Lys Val Pro Val Ala Glu Lys lie Pro Gly Asp Leu Pro 
580 585 590 

gec gtg tec tac gee ccc aac acc ctg aag ccc gtg tec gec tec gtg 1824 
Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val 
595 600 605 

gag ggc aac ggc gee gee gcg ccc aag gtc ggc acc acc gee ccc gee 1872 
Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 
610 615 620 

atg ggc gcg tgg cgc gcg acc acc ccc teg ggc ccc teg ccc gee gee 1920 
Met Gly Ala Trp Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 
625 630 635 640 

gee acc ccc aag gtg acc acc tac aag ccc gee ctg ccc gee acc gee 1968 
Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 
645 650 655 

aag ccc aag acc get ggc etc aag ctg gec ggt gag gee tec acc acc 2016 
Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 
660 665 670 

teg acc teg gag aac ggc get gee tec aac ggc aac ggc aac ggt gee 2064 
Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 
675 680 685 

teg gee tec aag acc teg get gee aag ccc ctg gtc tec gec gec acc 2112 
Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 
690 695 700 

cgc aag tec gee 2124 

Arg Lys Ser Ala 

705 



<210> 3 
<211> 708 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid sequence 
of the GBSSI of Chlamydomonas reinhardtii 



<400> 3 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro He Val 
15 10 15 

He Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 

Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 40 45 

Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp He Val Met Val 
50 55 60 

Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 

Thr Gly Gly Leu Pro He Glu Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 

Thr He Ala Pro Arg Tyr Asp Gin Tyr Ala Asp Ala Trp Asp Thr Ser 
100 105 110 

Val Val Val Asp He Met Gly Glu Lys Val Arg Tyr Phe His Ser He 
115 120 125 

Lys Lys Gly Val His Arg Val Trp He Asp His Pro Trp Phe Leu Ala 
130 135 140 

Lys Val Trp Gly Lys Thr Gly Ser Lys Leu Tyr Gly Pro Arg Ser Gly 
145 150 155 160 

Ala Asp Tyr Leu Asp Asn His Lys Arg Phe Ala Leu Phe Cys Lys Ala 
165 170 175 

Ala He Glu Ala Ala Arg Val Leu Pro Phe Gly Pro Gly Glu Asp Cys 
180 185 190 

Val Phe Val Ala Asn Asp Trp His Ser Ala Leu Val Pro Val Leu Leu 
195 200 205 

Lys Asp Glu Tyr Gin Pro Lys Gly Gin Phe Thr Lys Ala Lys Ser Val 
210 215 220 

Leu Ala He His Asn He Ala Phe Gin Gly Arg Met Trp Glu Glu Ala 
225 230 235 240 

Phe Lys Asp Thr Lys Leu Pro Gin Ala Ala Phe Asp Lys Leu Ala Phe 
245 250 255 

Ser Asp Gly Tyr Ala Lys Val Tyr Thr Glu Ala Thr Pro Met Glu Glu 
260 265 270 

Asp Glu Lys Pro Pro Leu Thr Gly Lys Thr Tyr Lys Lys lie Asn Trp 
275 280 285 



Leu Lys Gly Gly He He Ala Ala Asp Lys Leu Val Thr Val Ser Pro 
290 295 300 



Asn Tyr Ala Thr Glu lie Ala Ala Asp Ala Ala Gly Gly Val Glu Leu 
305 310 315 320 

Asp Thr Val lie Arg Ala Lys Gly lie Glu Gly lie Val Asn Gly Met 
325 330 335 

Asp lie Glu Glu Trp Asn Pro Lys Thr Asp Lys Phe Leu Ser Ala Pro 
340 345 350 

Tyr Asp Gin Asn Ser Val Tyr Ala Gly Lys Ala Ala Ala Lys Glu Ala 
355 360 365 

Leu Gin Ala Glu Leu Gly Leu Pro Val Asp Pro Thr Ala Pro Leu Phe 
370 375 380 

Ala Phe He Gly Arg Leu Glu Glu Gin Lys Gly Val Asp He He Leu 
385 390 395 400 

Ala Ala Leu Pro Lys He Leu Ala Thr Pro Lys Val Gin He Ala He 
405 410 415 

Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala He Gly 
420 425 430 

Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

Arg Phe Glu Pro Cys Gly Leu He Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 

Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys He 
530 535 540 

Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
545 550 555 560 

Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 

Glu Glu He Lys Val Pro Val Ala Glu Lys He Pro Gly Asp Leu Pro 
580 585 590 

Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val 
595 600 605 

Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 



610 



615 



620 



Met Gly Ala Trp Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 
625 630 635 640 

Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 
645 650 655 

Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 
660 665 670 

Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 
675 680 685 

Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 
690 695 700 



Arg Lys Ser Ala 
705 



<210> 4 
<211> 1953 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fragment of the 
complete sequence of cDNA coding for the GBSSI of 
Chlamydomonas reinhardtii and coding for the mature 
GBSSI protein 

<220> 

<221> CDS 

<222> (1)..(1953) 

<400> 4 

gcg ctg gac ate gtg atg gtt get get gag gtc gec cct tgg tec aag 48 
Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gec cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 



gac cac ccc tgg ttc ctg gcc aag gtc tgg ggc aag acc ggc tec aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gcc ctg ttc tgc aag gcc get att gag get gcc cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gcc aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gcc ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc acc aag gcc aag teg gtg ctg get ate cac aac ate gcc ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gcc 576 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

gcc ttt gac aag ctg gcc ttc teg gac ggc tat gcc aag gtt tac act 624 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gcc acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gcc gac 720 
Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gcc gat 768 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

gcc gcc ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 864 
Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gcc aag gag gcc ctg cag gcc gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 



305 310 315 320 

gac ccc acc gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gcc acc 1056 
Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

ccc aag gtg cag ate gcc ate ctg ggt acc ggc aag gcc gcc tac gag 1104 
Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gcc aag ggc 1152 
Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gcc ggc gcc 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 415 

ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc 1344 
Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 

gcc ctg aac ccc gac aag ctg gac gag get gac gcc gac gcc ctg gcc 1392 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 1440 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 
Glu Met Val Ala Asn Cys He Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu He Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc ggc gac ctg ccc gcc gtg tec tac gcc ccc aac acc ctg 1632 
Lys He Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro Asn Thr Leu 
530 535 540 



aag ccc gtg tec gec tec gtg gag ggc aac ggc gec gec gcg ccc aag 1680 
Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala Ala Pro Lys 
545 550 555 560 



gtc ggc acc acc gec ccc gec atg ggc gcg tgg cgc gcg acc acc ccc 1728 
Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala Thr Thr Pro 
565 570 575 

teg ggc ccc teg ccc gec gee gee acc ccc aag gtg acc acc tac aag 1776 
Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr Thr Tyr Lys 
580 585 590 

ccc gee ctg ccc gec acc gee aag ccc aag acc get ggc etc aag ctg 1824 
Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

gee ggt gag gec tec acc acc teg acc teg gag aac ggc get gee tec 1872 
Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

aac ggc aac ggc aac ggt gee teg gec tec aag acc teg get gee aag 1920 
Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 

ccc ctg gtc tec gee gec acc cgc aag tec gee t 1953 

Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
645 650 



<210> 5 
<211> 651 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid sequence 
of the GBSSI of Chlamydomonas reinhardtii and the 
mature GBSSI protein 

<400> 5 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 



Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 



85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp lie He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 



1 



Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 

Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 

Lys lie Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro Asn Thr Leu 
530 535 540 

Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala Ala Pro Lys 
545 550 555 560 

Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala Thr Thr Pro 
565 570 575 

Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr Thr Tyr Lys 
580 585 590 

Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 

Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
645 650 



<210> 6 
<211> 1314 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fragment of the 



complete cDNA coding for the GBSSI of Chlamydomonas 
reinhardtii 



<220> 

<221> CDS 

<222> (1)..(1314) 



<400> 6 

gcg ctg gac ate gtg atg gtt get get gag gtc gee cct tgg tec aag 48 
Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gec cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gee tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

gac cac ccc tgg ttc ctg gee aag gtc tgg ggc aag acc ggc tec aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tgc aag gee get att gag get gec cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gee ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc acc aag gec aag teg gtg ctg get ate cac aac ate gec ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gec 576 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 



gee ttt gac aag ctg gee ttc teg gac ggc tat gec aag gtt tac act 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 



624 



195 200 205 

gag gcc acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gcc gac 720 
Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gcc gat 768 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

gcc gcc ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 864 
Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gcc aag gag gcc ctg cag gcc gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc acc gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gcc acc 1056 
Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

ccc aag gtg cag ate gcc ate ctg ggt acc ggc aag gcc gcc tac gag 1104 
Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gcc aag ggc 1152 
Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gcc ggc gcc 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 



ggc ctg gtc gac acc gtc 
Gly Leu Val Asp Thr Val 
435 



1314 



<210> 7 
<211> 438 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid sequence 
of the GBSSI of Chlamydomonas reinhardtii 

<400> 7 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 



Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp He He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 . 350 

Pro Lys Val Gin He Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

Gly Leu Val Asp Thr Val 
435 



<210> 8 
<211> 1593 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fragment of the 
complete cDNA coding for the GBSSI of Chlamydomonas 
reinhardtii 



<220> 

<221> CDS 

<222> (1)..(1593) 



<400> 8 

gcg ctg gac ate gtg atg gtt get get gag gtc gec cct tgg tec aag 48 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg ace att gec cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac ace teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

gac cac ccc tgg ttc ctg gee aag gtc tgg ggc aag ace ggc tec aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tgc aag gec get att gag get gec cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gee ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc ace aag gee aag teg gtg ctg get ate cac aac ate gec ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gee 576 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

gee ttt gac aag ctg gee ttc teg gac ggc tat gee aag gtt tac act 624 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gee ace ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 



acc tac aag aag 
Thr Tyr Lys Lys 
225 

aag ctg gtg act 
Lys Leu Val Thr 



gcc gcc ggc ggt 
Ala Ala Gly Gly 
260 

gag ggc att gtg 
Glu Gly He Val 
275 

gac aag ttc ctg 
Asp Lys Phe Leu 
290 

aag gcc gcc gcc 
Lys Ala Ala Ala 
305 

gac ccc acc gcc 
Asp Pro Thr Ala 



aag ggt gtg gac 
Lys Gly Val Asp 
340 

ccc aag gtg cag 
Pro Lys Val Gin 
355 

aag ctg gtg aac 
Lys Leu Val Asn 
370 

gtg gtc aag ttc 
Val Val Lys Phe 
385 

gac ttc atg ctg 
Asp Phe Met Leu 



ctg cac gcc atg 
Leu His Ala Met 
420 

ggc ctg gtc gac 
Gly Leu Val Asp 
435 

gcc ctg aac ccc 
Ala Leu Asn Pro 



ate aac tgg 
He Asn Trp 
230 

gtg teg ccc 
Val Ser Pro 
245 

gtg gag ctg 
Val Glu Leu 



aac ggc atg 
Asn Gly Met 



tct gcg ccc 
Ser Ala Pro 
295 

aag gag gcc 
Lys Glu Ala 
310 

ccc ctg ttc 
Pro Leu Phe 
325 

ate ate ctg 
He He Leu 



ate gcc ate 
He Ala He 



gcc ate ggc 
Ala He Gly 
375 

teg gcg ccc 
Ser Ala Pro 
390 

gtg ccc teg 
Val Pro Ser 
405 

cac tac ggt 
His Tyr Gly 



acc gtc aag 
Thr Val Lys 



gac aag ctg 
Asp Lys Leu 



ctg aag ggt ggc 
Leu Lys Gly Gly 
235 

aac tac gcg acc 
Asn Tyr Ala Thr 
250 

gac acc gtc ate 
Asp Thr Val He 
265 

gac att gag gag 
Asp He Glu Glu 
280 

tac gac cag aac 
Tyr Asp Gin Asn 



ctg cag gcc gag 
Leu Gin Ala Glu 
315 

gcc ttc ate ggc 
Ala Phe He Gly 
330 

gcc gcc ctg ccc 
Ala Ala Leu Pro 
345 

ctg ggt acc ggc 
Leu Gly Thr Gly 
360 

acc aag tac aag 
Thr Lys Tyr Lys 



ctg gcg cac atg 
Leu Ala His Met 
395 

cgc ttc gag ccc 
Arg Phe Glu Pro 
410 

acc gtg ccc gtg 
Thr Val Pro Val 
425 

gag ggc gtc acc 
Glu Gly Val Thr 
440 

gac gag get gac 
Asp Glu Ala Asp 



att ate gcc 
He He Ala 



gag ate get 
Glu He Ala 



cgc gcc aag 
Arg Ala Lys 
270 

tgg aac ccc 
Trp Asn Pro 
285 

age gtc tac 
Ser Val Tyr 
300 

ctg ggc ctg 
Leu Gly Leu 



cgc ctg gag 
Arg Leu Glu 



aag ate ctg 
Lys He Leu 
350 

aag gcc gcc 
Lys Ala Ala 
365 

ggc cgc gcc 
Gly Arg Ala 
380 

etc acc gcc 
Leu Thr Ala 



tgc ggc ctg 
Cys Gly Leu 



gta gcc tec 
Val Ala Ser 
430 

ggc ttc cac 
Gly Phe His 
445 

gcc gac gcc 
Ala Asp Ala 



gcc gac 720 
Ala Asp 
240 

gcc gat 768 
Ala Asp 
255 

ggc att 816 
Gly He 



aag acc 864 
Lys Thr 



gcc ggc 912 
Ala Gly 



cct gtg 960 
Pro Val 
320 

gag cag 1008 

Glu Gin 

335 

gcc acc 1056 
Ala Thr 



tac gag 1104 
Tyr Glu 



aag ggc 1152 
Lys Gly 



ggc gcc 1200 
Gly Ala 
400 

ate cag 1248 

He Gin 

415 

acc ggc 1296 
Thr Gly 



atg ggc 1344 
Met Gly 



ctg gcc 1392 
Leu Ala 



450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 1440 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 
Glu Met Val Ala Asn Cys He Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu He Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc 1593 
Lys He Pro 
530 



<210> 9 
<211> 531 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Amino acid sequence 
of the GBSSI of Chlamydomonas reinhardtii 

<400> 9 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 HO 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 



Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn He Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly He He Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu He Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp lie He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 



Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 



Ala Leu Asn Pro Asp Lys Leu Asp 
450 455 

Ala Thr Val Arg Arg Ala Ser Glu 
465 470 

Glu Met Val Ala Asn Cys lie Ser 
485 

Ala Gin Lys Trp Glu Gly Leu Leu 
500 

Gly Val Ala Thr Ala Lys Lys Glu 
515 520 

Lys lie Pro 
530 



Glu Ala Asp Ala Asp Ala Leu Ala 
460 

Val Phe Ala Gly Gly Arg Tyr Pro 
475 480 

Gin Asp Leu Ser Trp Ser Lys Pro 
490 495 

Glu Glu Val Val Tyr Gly Lys Gly 
505 510 

Glu He Lys Val Pro Val Ala Glu 
525 



<210> 10 
<211> 1696 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: cDNA coding for the 
carboxy terminal sequence for the GBSSI of 
Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (3). .(716) 

<400> 10 

at teg gca cga gtg cac gec atg cac tac ggt acc gtg ccc gtg gta 47 
Ser Ala Arg Val His Ala Met His Tyr Gly Thr Val Pro Val Val 
15 10 15 

gec tec acc ggc ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc 95 
Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly 
20 25 30 

ttc cac atg ggc gec ctg aac ccc gac aag ctg gac gag get gac gec 143 
Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala 
35 40 45 

gac gec ctg gee gee acc gtg cgc cgt gec age gag gtg ttt gcg ggc 191 
Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly 
50 55 60 

ggc cgc tac ccc gag atg gtg gee aac tgc ate age cag gac ctg tec 239 
Gly Arg Tyr Pro Glu Met Val Ala Asn Cys He Ser Gin Asp Leu Ser 
65 70 75 



tgg tec 
Trp Ser 
80 



aag ccc gec cag aag tgg gag ggc ctg ctg 
Lys Pro Ala Gin Lys Trp Glu Gly Leu Leu 
85 90 



gag gag gtg gtg 
Glu Glu Val Val 
95 



287 



tac ggc aag ggc ggc gtg gec acc gec aag aag gag gag ate aag gtg 335 
Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val 
100 105 110 



ccc gtt gec gag aag ate ccc ggc gac ctg ccc gee gtg tec tac gee 
Pro Val Ala Glu Lys lie Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala 
115 120 125 



gec gcg ccc aag gtc ggc acc acc gec ccc gee atg ggc gcg tgg cgc 
Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg 
145 150 155 



ggc etc aag ctg gee ggt gag gee tec acc acc teg acc teg gag aac 
Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn 
195 200 205 



teg get gec aag ccc ctg gtc tec gee gee acc cgc aag tec gee 
Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
225 230 235 



383 



ccc aac acc ctg aag ccc gtg tec gee tec gtg gag ggc aac ggc gee 431 
Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala 
130 135 140 



479 



gcg acc acc ccc teg ggc ccc teg ccc gee gec gec acc ccc aag gtg 527 
Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val 
160 165 170 175 

acc acc tac aag ccc gee ctg ccc gee acc gee aag ccc aag acc get 575 
Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala 
180 185 190 



623 



ggc get gee tec aac ggc aac ggc aac ggt gee teg gee tec aag acc 671 
Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr 
210 215 220 



716 



taaageggea 


gtagccgcag 


aggeggegae 


ageatgageg 


gctcgaccaa 


agctgtggca 


776 


ggaacggctg 


tagcagegge 


aggcggccgc 


caccggcgag 


gagcaggctt 


geggcagega 


836 


gggegatgag 


cttagcggcc 


gtgagcatgg 


caggeggaaa cgtgtgtact 


gaaatgtggt 


896 


gcatgagagt 


gtcgtgctgt 


aatgaagtcg 


gttttgegag 


aceggagaaa 


cgccggtttg 


956 


gttttgtagt 


geagggectg 


tggtttcggt 


tttgcccaag 


tccaaaagaa 


gagtaacgaa 


1016 


actgtagcag 


tagcagagca 


cttgcgcggc 


gcggcgacca 


cgccggcccg 


tgcgcagcct 


1076 


gtcctgccct 


cagccttgtg 


atteggegge 


aagagggegg 


gtctgtacac 


tccatccatt 


1136 


ccaggatttt 


tgcaggctgc 


ctgagagttt 


gecattttgt 


gggacgtgag 


eggegggacg 


1196 


gccgcgccgg 


gctctcctac 


cgcctccggc 


aacggagaag tgggaggege 


tgtagecegg 


1256 



tgacccccca 


atgtagagga 


tgggatacat 


ggggcctggg 


tcgcccctcg 


atggttttgt 


ggccctcgca 


aggcccgggt 


gccttgggct 


tggccagcgg 


ttttttgagg 


ctggctcgaa 


gx. tggagaaa 


yay y Cy y cH-a 




cggaggcatt 


ggcggactat 


tgacccagta 


gagagtaccg 


cgcattaatc 


cgtgagagag 


aaaaaaaaaa 


aaaaaaaaaa 





aagagcgtgt ggaatggtgg taaaagagga 1316 
tgaggtgcag acggcaccgt cggcgtcaaa 1376 
catttttggt gcccgtcgat gatgagagat 1436 
gcgagggttt gtggaagtgg agcgaggagg 1496 
gaggtacaca aagtggagcg tgcgacggca 1556 
gtgtggaaag tagttggacc tgaattcttt 1616 
taacaaagat ggcacctgaa aaaaaaaaaa 1676 

1696 



<210> 11 
<211> 1696 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: cDNA coding for the 
carboxy terminal sequence for the GBSSI of Chlamydomonas 
reinhardtii 

<400> 11 

tttttttttt tttttttttt tttttttttt ttcaggtgcc atctttgtta ctctctcacg 60 
gattaatgcg cggtactctc aaagaattca ggtccaacta ctttccacac tactgggtca 120 
atagtccgcc aatgcctccg tgccgtcgca cgctccactt tgtgtacctc cagtcaagca 180 
tgtccgcctc tttctccaac cctcctcgct ccacttccac aaaccctcgc ttcgagccag 240 
cctcaaaaaa ccgctggcca atctctcatc atcgacgggc accaaaaatg agcccaaggc 300 
acccgggcct tgcgagggcc tttgacgccg acggtgccgt ctgcacctca acaaaaccat 360 
cgaggggcga cccaggcccc tcctctttta ccaccattcc acacgctctt atgtatccca 420 
tcctctacat tggggggtca ccgggctaca gcgcctccca cttctccgtt gccggaggcg 480 
gtaggagagc ccggcgcggc cgtcccgccg ctcacgtccc acaaaatggc aaactctcag 540 
gcagcctgca aaaatcctgg aatggatgga gtgtacagac ccgccctctt gccgccgaat 600 
cacaaggctg agggcaggac aggctgcgca cgggccggcg tggtcgccgc gccgcgcaag 660 
tgctctgcta ctgctacagt ttcgttactc ttcttttgga cttgggcaaa accgaaacca 720 
caggccctgc actacaaaac caaaccggcg tttctccggt ctcgcaaaac cgacttcatt 780 
acagcacgac actctcatgc accacatttc agtacacacg tttccgcctg ccatgctcac 840 
ggccgctaag ctcatcgccc tcgctgccgc aagcctgctc ctcgccggtg gcggccgcct 900 
gccgctgcta cagccgttcc tgccacagct ttggtcgagc cgctcatgct gtcgccgcct 960 
ctgcggctac tgccgcttta ggcggacttg cgggtggcgg cggagaccag gggcttggca 1020 
gccgaggtct tggaggccga ggcaccgttg ccgttgccgt tggaggcagc gccgttctcc 1080 
gaggtcgagg tggtggaggc ctcaccggcc agcttgaggc cagcggtctt gggcttggcg 1140 
gtggcgggca gggcgggctt gtaggtggtc accttggggg tggcggcggc gggcgagggg 1200 
cccgaggggg tggtcgcgcg ccacgcgccc atggcggggg cggtggtgcc gaccttgggc 1260 
gcggcggcgc cgttgccctc cacggaggcg gacacgggct tcagggtgtt gggggcgtag 1320 
gacacggcgg gcaggtcgcc ggggatcttc tcggcaacgg gcaccttgat ctcctccttc 1380 
ttggcggtgg ccacgccgcc cttgccgtac accacctcct ccagcaggcc ctcccacttc 1440 
tgggcgggct tggaccagga caggtcctgg ctgatgcagt tggccaccat ctcggggtag 1500 
cggccgcccg caaacacctc gctggcacgg cgcacggtgg cggccagggc gtcggcgtca 1560 
gcctcgtcca gcttgtcggg gttcagggcg cccatgtgga agccggtgac gccctccttg 1620 
acggtgtcga ccaggccgcc ggtggaggct accacgggca cggtaccgta gtgcatggcg 1680 



tgcactcgtg ccgaat 



1696 



<210> 12 
<211> 238 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Carboxy terminal 
peptide sequence of the GBSSI of Chlamydomonas 
reinhardtii 

<400> 12 

Ser Ala Arg Val His Ala Met His Tyr Gly Thr Val Pro Val Val Ala 
15 10 15 

Ser Thr Gly Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe 
20 25 30 

His Met Gly Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp 
35 40 45 

Ala Leu Ala Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly 
50 55 60 

Arg Tyr Pro Glu Met Val Ala Asn Cys He Ser Gin Asp Leu Ser Trp 
65 70 75 80 

Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr 
85 90 95 

Gly Lys Gly Gly Val Ala Thr Ala Lys Lys Glu Glu He Lys Val Pro 
100 105 110 

Val Ala Glu Lys He Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro 
115 120 125 

Asn Thr Leu Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala 
130 135 140 

Ala Pro Lys Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala 
145 150 155 160 

Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr 
165 170 175 

Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly 
180 185 190 

Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly 
195 200 205 

Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser 
210 215 220 



Ala Ala Lys Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
225 230 235 



<210> 13 
<211> 23 
<212> PRT 

<213> Artificial Sequence 



<220> . 
<223> Description of Artificial Sequence: Ammo terminal 
fragment of GBSSI of Chlamydomonas reinhardtii 



<400> 13 _ 
Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Gly Gly Lys 
15 10 15 



Thr Gly Gly Leu Gly Asp Val 
20 



<210> 14 
<211> 23 
<212> PRT 

<213> Artificial Sequence 



<220> . 
<223> Description of Artificial Sequence: Amino terminal 
fragment of GBSSI of Chlamydomonas reinhardtii 



<400> 14 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
I 5 10 15 



Thr Gly Gly Leu Gly Asp Val 
20 



<210> 15 
<211> 3117 
<212> DNA 

<213> Chlamydomonas reinhardtii 
<400> 15 

tttttttttt tttttttttt tttttttttt 
attaatgcgc ggtactctca aagaattcag 
tagtccgcca atgcctccgt gccgtcgcac 
gtccgcctct ttctccaacc ctcctcgctc 
ctcaaaaaac cgctggccaa tctctcatca 
cccgggcctt gcgagggcct ttgacgccga 
gaggggcgac ccaggcccct cctcttttac 
cctctacatt ggggggtcac cgggctacag 
taggagagcc cggcgcggcc gtcccgccgc 
cagcctgcaa aaatcctgga atggatggag 
acaaggctga gggcaggaca ggctgcgcac 
gctctgctac tgctacagtt tcgttactct 
aggccctgca ctacaaaacc aaaccggcgt 



tcaggtgcca tctttgttac tctctcacgg 60 
gtccaactac tttccacact actgggtcaa 120 
gctccacttt gtgtacctcc agtcaagcat 180 
cacttccaca aaccctcgct tcgagccagc 240 
tcgacgggca ccaaaaatga gcccaaggca 300 
cggtgccgtc tgcacctcaa caaaaccatc 360 
caccattcca cacgctctta tgtatcccat 420 
cgcctcccac ttctccgttg ccggaggcgg 480 
tcacgtccca caaaatggca aactctcagg 540 
tgtacagacc cgccctcttg ccgccgaatc 600 
gggccggcgt ggtcgccgcg ccgcgcaagt 660 
tcttttggac ttgggcaaaa ccgaaaccac 720 
ttctccggtc tcgcaaaacc gacttcatta 780 



cagcacgaca ctctcatgca ccacatttca 
gccgctaagc tcatcgccct cgctgccgca 
ccgctgctac agccgttcct gccacagctt 
tgcggctact gccgctttag gcggacttgc 
ccgaggtctt ggaggccgag gcaccgttgc 
aggtcgaggt ggtggaggcc tcaccggcca 
tggcgggcag ggcgggcttg taggtggtca 
ccgagggggt ggtcgcgcgc cacgcgccca 
cggcggcgcc gttgccctcc acggaggcgg 
acacggcggg caggtcgccg gggatcttct 
tggcggtggc cacgccgccc ttgccgtaca 
gggcgggctt ggaccaggac aggtcctggc 
ggccgcccgc aaacacctcg ctggcacggc 
cctcgtccag cttgtcgggg ttcagggcgc 
cggtgtcgac caggccgccg gtggaggcta 
gcagctggat caggccgcag ggctcgaagc 
cggtgagcat gtgcgccagg ggcgccgaga 
acttggtgcc gatggcgttc accagcttct 
cgatctgcac cttgggggtg gccaggatct 
ccttctgctc ctccaggcgg ccgatgaagg 
ggcccagctc ggcctgcagg gcctccttgg 
ggtcgtaggg cgcagacagg aacttgtcgg 
cgttcacaat gccctcaatg cccttggcgc 
cggcatcggc agcgatctcg gtcgcgtagt 
cgataatgcc acccttcagc cagttgatct 
tctcgtcctc ctccatgggg gtggcctcag 
ccagcttgtc aaaggcggcc tggggcagct 
ggccctggaa ggcgatgttg tggatagcca 
tgggctggta ctcgtccttc agcaggacgg 
cgaagacgca gtcctcgccg gggccgaagg 
tgcagaacag ggcgaagcgc ttgtggttgt 
acagcttgga gccggtcttg ccccagacct 
cgcggtgcac gcccttcttg atggagtgga 
cgaccaccga ggtgtcccag gcgtcagcgt 
cgcggtggcc gcgcttgacc agctcaatag 
ccgtcttgga ccaaggggcg acctcagcag 
caccagtggc accagtaaca gccgagcgcg 
gctcacgcag cagctggttc gcggtcttct 
taggacgcgc gctgctgggg cggctggtag 



gtacacacgt ttccgcctgc catgctcacg 840 
agcctgctcc tcgccggtgg cggccgcctg 900 
tggtcgagcc gctcatgctg tcgccgcctc 960 
gggtggcggc ggagaccagg ggcttggcag 1020 
cgttgccgtt ggaggcagcg ccgttctccg 1080 
gcttgaggcc agcggtcttg ggcttggcgg 1140 
ccttgggggt ggcggcggcg ggcgaggggc 1200 
tggcgggggc ggtggtgccg accttgggcg 1260 
acacgggctt cagggtgttg ggggcgtagg 1320 
cggcaacggg caccttgatc tcctccttct 1380 
ccacctcctc cagcaggccc tcccacttct 1440 
tgatgcagtt ggccaccatc tcggggtagc 1500 
gcacggtggc ggccagggcg tcggcgtcag 1560 
ccatgtggaa gccggtgacg ccctccttga 1620 
ccacgggcac ggtaccgtag tgcatggcgt 1680 
gcgagggcac cagcatgaag tcggcgccgg 1740 
acttgaccac gcccttggcg cggcccttgt 1800 
cgtaggcggc cttgccggta cccaggatgg 1860 
tgggcagggc ggccaggatg atgtccacac 1920 
cgaacagggg ggcggtgggg tccacaggca 1980 
cggcggcctt gccggcgtag acgctgttct 2040 
tcttggggtt ccactcctca atgtccatgc 2100 
ggatgacggt gtccagctcc acaccgccgg 2160 
tgggcgacac agtcaccagc ttgtcggcgg 2220 
tcttgtaggt ctttcccgtc agcgggggct 2280 
tgtaaacctt ggcatagccg tccgagaagg 2340 
tcgtgtcctt gaaagcctcc tcccacatgc 2400 
gcaccgactt ggccttggtg aactggccct 2460 
gcaccagggc ggagtgccag tcgttggcca 2520 
gcagcacgcg ggcagcctca atagcggcct 2580 
ccaggtagtc agcgccggag cgggggccgt 2640 
tggccaggaa ccaggggtgg tcaatccaca 2700 
agtagcggac cttctcgccc atgatgtcca 2760 
actggtcgta gcgaggggca atggtcatga 2820 
gcaggccacc agtcacatcg cccaggccgc 2880 
caaccatcac gatgtccagc gcgcaagtgg 2940 
aggtggactt gcgtgcggag ccacgagcaa 3000 
tgacaccgaa cgacgcggca ttgatgacga 3060 
aggcaacagc catacgctcc gcctggc 3117 



